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FINAL  REPORT 


In  June,  1969,  we  began  the  development  of  an  interactive  information 
retrieval  system  that  would  allow  researchers  to  querry  a  complex,  "ragged” 
dataset  interactively  and  receive  information  in  a  form  suitable  to  their 
research  interests.  Although  it  was  expected  that  this  would  require  one 
year  to  develop,  the  system  has  been  developed  and  is  now  operating. 

We  have  developed  both  a  full  file  maintenance  system  and  a  full  file 
search  system.  These  two  in  combination  constitute  an  information  maintenance 
and  retrieval  system.  Each  part  of  the  system — update  and  search — are  de¬ 
scribed  in  this  progress  report.  A  full  explication  of  the  search  segment 
oi:  the  system  occurs  in  the  attached  manual  . 

A.  Updating  the  File: 

The  update  program  is  designed  so  that  tha  length,  width  and  content  of 
the  file  can  be  manipulated  interactively.  We  can  add  new  sets  of  informa¬ 
tion,  add  specific  infornution,  change  existing  information,  edite  any  card, 
or  delete  any  card.  During  this  process  error  messages  are  given  in  simple 
language.  A  copy  program  is  part  of  this  set  of  routines  which  enables  us  to 
create  new  files  cheaply  and  efficiently  at  no  risk  to  the  existing  file  new 
information.  As  an  example,  in  a  recent  run  involving  the  insertion  and 
editing  of  one  thousand  records  the  recopy  of  the  entire  tape  and  the  print¬ 
ing  of  the  entire  tape  cost  $15.00,  a  very  low  cost  in  view  of  the  fact  that 
the  file  consists  of  the  equivalent  of  35,000  punch  cards. 

B.  Search  Systems: 

During  the  preceding  six  months  we  designed  and  implemented  a  search 
compiler.  This  system  accepts  as  input  the  requests  of  a  researcher  who  has 
direct  communications  with  the  search  system  via  remote  computer  terminal 
in  an  online  computer  system.  The  output  from  the  search  request  is  a 
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program  in  object  code  which  is  run  directly  on  t.he  machine.  The  researcher 
can  enter  his  search  request— which  may  consist  of  any  number  of  statements 
—in  a  free  form.  He  may  also  design  his  output  format  in  any  way  that  he 
wishes,  included  in  the  system  is  the  ability  to  create  data  items  from 
information  on  the  file  as  well  as  the  ability  to  access  any  code  category 
on  the  file  with  a  unique  keyword  structure.  The  advantage  of  this  re¬ 
search  system  is  that  it  makes  the  data  base  fully  accessable  for  any  re¬ 
search  purposes  germaine  to  the  database.  The  development  of  the  search 
system  makes  it  unnecessary  for  the  researcher  to  generate  special  purpose 
programs  for  each  research  :  |uest.  in  •K''*  cases  the  request  for  output 
can  be  satisfied  within  minutes  after  the  research  request  is  formulated. 

SIGNIFICANT  SCIENTIFIC  AND  TECHNICAL  ACCOMPLISHMENTS 

1.  The  construction  of  an  update  facility  which  assures  the  integrity 

of  the  original  file.  The  technological  developments  that  allowed  this  Include 
the  development  of  a  structured  control  facility  throuqh  the  use  of  sophis¬ 
ticated  programming  techniques  based  on  Assembler  language. 

2.  The  search  segment  of  the  CARESS  compiler  has  done  for  Information 
retrieval  what  the  fortran  compilers  did  for  numerical  calculations.  The 
search  logic  implied  by  the  users  request  is  built  directly  into  the  search 
program  rather  than  into  an  intermediate  table.  This  technique  kas  not 
been  applied  in  information  retrieval.  It  is  amazingly  powerful. 

3.  The  creation  of  a  practical,  full-powered,  transferable  information 
retrieval  system  operating  within  the  time  sharing  was  done  for  a  ridiculously 
low  sum  of  money.  This  Information  retrieval  system  can  be  used  on  other  data 
bases;  It  can  be  used  on  other  machine  configurations. 
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4„  We  have  completed  three  months  ahead  of  schedule  the  development  of 
the  interactive  retrieval  system  with  all  of  the  capabilities  outlined  in  the 
grant  proposal,  A  manual  is  available  for  distribution. 

TESTING  OF  THE  SYSTEM 

We  have  now  tested  this  system  under  a  -umber  of  conditions.  The  first 
test  involved  undergraduate  students  in  a  course  on  the  Government  and 
Politics  of  Eastern  Europe.  After  a  blackboard  demonstration  of  the  database 
and  the  retrieval  program  (fifteen  minutes)  an  undergraduate  student  not 
acquainted  with  computers  or  computing  accessed  information  on  those  persons 
purged  in  Czechoslovakia.  The  information  that  he  retrieved  went  well  beyond 
the  literature.  Subsequently  a  number  of  students  have  utilized  the  system 
by  reference  to  the  manual . alone.  Two  other  data  bases  are  now  being 
formatted  in  a  similar  manner.  In  the  next  six  months  CARESS  will  be  de 
monstrated  using  portable  terminals  at  meetings  in  Atlanta,  Pittsburgh, 
Chicago,  New  York  and  San  Francisco. 

INFLUENCE  OF  SYSTEM  ON  INFORMATION  RETRIEVAL 

We  have  had  the  extensive  inquires  from  persons  interested  in  the  ^ata 
being  accessed,  and  in  the  CARESS  system.  These  inquiries  come  from  students 
and  faculty  at  other  institutions  as  well  as  persons  in  the  eductionai  a,.d 
health  fields.  We  are  able  to  demonstrate  CARESS  from  any  place  which  has 
an  operative  phone  system,  and  we  have  bee-  asked  to  do  so  at  a  wide  variety 
of  places. 
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CARESS  in  an  interactive  information  retrieval  aystem  that  allows  any  user  to 
querry  a  complex  empirical  ragged  data  set  without  expertise  in  computing,  the 
system  consists  of  a  file  maintenance  casponsnt  and  a  full  file  search  system. 
The  maintenance  system  assures  the  integrity  of  the  files;  the  search  logic 
implied  by  the  users  request  is  built  directly  into  the  search  program. 
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